[Feature] compare for 2 diff models#4356
Conversation
|
Codex usage limits have been reached for code reviews. Please check with the admins of this repo to increase the limits by adding credits. |
for more information, see https://pre-commit.ci
Summary of ChangesHello, I'm Gemini Code Assist1! I'm currently reviewing this pull request and will post my feedback shortly. In the meantime, here's a summary to help you and other reviewers quickly get up to speed! This pull request significantly enhances the model comparison capabilities by introducing a flexible, generalized system. Users can now select and compare any two models, including GGUF, LoRA, and standard models, side-by-side. The system intelligently detects if a LoRA is loaded, defaulting to the existing fast comparison method, while providing a new, robust sequential loading and generation process for general comparisons, complete with user feedback and optimizations. This broadens the utility of the comparison feature beyond just LoRA models, making it a more versatile tool for model evaluation. Highlights
Changelog
Activity
Using Gemini Code AssistThe full guide for Gemini Code Assist can be found on our documentation page, here are some quick tips. Invoking Gemini You can request assistance from Gemini at any point by creating a comment using either
Customization To customize Gemini Code Assist for GitHub experience, repository maintainers can create a configuration file and/or provide a custom code review style guide (such as PEP-8 for Python) by creating and adding files to a Limitations & Feedback Gemini Code Assist may make mistakes. Please leave feedback on any instances where its feedback is incorrect or counter productive. You can react with 👍 and 👎 on @gemini-code-assist comments. If you're interested in giving your feedback about your experience with Gemini Code Assist for GitHub and other Google products, sign up here. You can also get AI-powered code generation, chat, as well as code reviews directly in the IDE at no cost with the Gemini Code Assist IDE Extension. Footnotes
|
There was a problem hiding this comment.
Code Review
This pull request introduces a generalized model comparison feature, allowing side-by-side comparison of any two models, which is a significant enhancement. The implementation correctly preserves the existing LoRA fast-path compare and adds a new sequential load-and-generate flow for the general case. The backend is optimized to avoid redundant model reloads, and the frontend provides good user feedback with progress toasts. My review includes a few suggestions for improving code clarity and robustness, such as removing an unused variable, refining exception handling, and simplifying a redundant expression.
for more information, see https://pre-commit.ci
… sizing (#4369) Co-authored-by: imagineer99 <samleejackson0@gmail.com>
|
great work! |
* compare for 2 diff models * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * resolving gemini comments * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * fix(studio): refine model-load toast stop action and compare selector sizing (unslothai#4369) Co-authored-by: imagineer99 <samleejackson0@gmail.com> --------- Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Co-authored-by: Daniel Han <danielhanchen@gmail.com> Co-authored-by: imagineer99 <samleejackson0@gmail.com>
Added generalized model comparison that lets users compare any two models side-by-side, while preserving the existing fast LoRA compare (simultaneous, adapter toggle) as an auto-detected fast path. When a LoRA is loaded, compare works exactly as before; otherwise, a new general compare mode renders per-side model pickers and orchestrates sequential load → generate → load → generate with progress toasts, an "already loaded" backend optimization to skip redundant reloads, immediate user message display on both sides, and a locked send button until both sides complete. Supports GGUF, LoRA, and standard models.